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Amendments to the Specification : 

Replace paragraph [0013], page 4, with the following amended paragraph: 

In certain embodiments of this aspect of the invention, the computer program 
product also includes instructions for causing the computer to: assign a score according to 
degree of relevancy of the corresponding contents to the information sample; and request 
a different input if the score is below a pre-determined threshold. Assigning the score 
includes: compar e comparing the topic keyword and the layout of the corresponding 
contents with those of the information sample to determine the degree of relevancy. 
Assigning the score also includes: compar e comparing the domain keyword, the semantic 
category, the event, and the layout of the corresponding contents with those of the 
information sample to determine the degree of relevancy. 

Replace paragraph [0014], page 4, with the following amended paragraph: 

Embodiments may have one or more of the following advantages. With these aspects of 
the invention, a user can combine any portion of web pages from any w e b sit e s web site to form 
a personalized web page efficiently and effectively. The web page can be displayed in a single 
browser window on a client machine, allowing the user to easily locate information of interest 
provided by different web sites. The method, system, and the computer product are adaptive to 
the ever-changing design of web sites. WebScript may adopt the XML format, and the user's 
annotation may be recorded in XML files. 
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Replace paragraph [0030], page 8, with the following amended paragraph: 

The layout of information object 24 provides the position and boundary of information 
object 24. The layout does not convey the meaning of information object 24; rather, the layout 
contains the format or the arrangement of the information object. For example, the layout of a 
paragraph of text includes the HTML tags in the paragraph and the positions of the tags relative 
to the domain keywords. The layout of an HTML table, on the other hand, includes the HTML 
tags and the dimensions of the table. If the layout has a homogenous style, such as the layout of 
object text 26b in Example 2.2 of FIG. 2A FIG 2B. the task of identifying the layout and 
updating information object 24 will be greatly simplified. 



Replace paragraph [0034], page 9, with the following amended paragraph: 

Identification engine 32 can also extract information about information object 24 from 
categorizing the semantics of certain words or phrases in the information object. In Example 2.3, 
the days of the week: Sat., Sun., Mon., Tues., and Wed., and the types of the weather: cloudy, 
sunny, and rain y (not shown), represent two semantic categories. The semantic categories are 
derived from a semantic tree 39, which includes a large collection of categories of words that 
have similar or identical meanings. 

Replace paragraph [0036], page 10, paragraph 0036 with the following amended 
paragraph: 

An example of the WTI for the input shown in Example 2.1 of FIG 2 FIG 2 A is as 
follows. 
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Replace paragraph [0043], page 12, with the following amended paragraph: 

Referring to FIG 4, a process 400 is shown for generating the WTI for an input. System 
10 receives the input and examines the information submitted for the first information object 24 
(box 40). Identification engine 32 extracts the WTI for the information object 24 from the 
submitted information (box 41). System 10 then determines if there is any remaining 
information object 24 in the input (box 42). If there is at least one remaining information object 
24, system 10 repeats the operations of boxes 41 and 42 until all the information objects 24 in the 
input are examined (box 43). Otherwise, system 10 completes process 400 box 41) (box 44). 



Replace the abstract at page 19 with the following amended abstract: 

T e chniqu e s for updating U pdating a personalized web page ar e d e scrib e d. Th e 
t e chniqu e s id e ntify b y identifying c haracteristic features and an information sample from an 
input^Fh e, in which the input specifies a web site that provides the information sample. When 
an update is required by a user-specified frequency, corresponding contents of the web site is 
d e t e rmin e d are determined b ased on the characteristic features. If the corresponding contents is 
highly are highly relevant to the information sample, the t e chniques e xtract th e corresponding 
contents are extracted, and update-the personalized web page is updated w ith the corresponding 
contents. 



